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Abstract 

We consider multi-antenna base stations using orthogonal frequency-division multiple access (OFDMA) and space division 
multiple access (SDMA) techniques to serve single antenna users, where some of those users have minimum rate requirements and 
must be served in the current time slot (real time users), while others do not have strict timing constraints (non real time users) 
and are served on a best effort basis. The resource allocation problem is to find the user assignment to subcarriers and the transmit 
beamforming vectors that maximize a linear utility function of the user rates subject to power and minimum rate constraints. 
The exact optimal solution to this problem can not be reasonably obtained for practical parameters values of the communication 
system. We thus derive a dual problem formulation whose optimal solution provides an upper bound to all feasible solutions and 
can be used to benchmark the performance of any heuristic method used to solve this problem. We also derive from this dual 
optimal solution a primal-feasible dual-based method to solve the problem and we compare its performance and computation time 
against a standard weight adjustment method. We find that our method follows the dual optimal bound more closely than the 
weight adjustment method. This off-line algorithm can serve as the basis to develop more efficient heuristic methods. 

L Introduction 

Multi-antenna base stations using orthogonal frequency-division multiple access (OFDMA) and space division multiple 
access (SDMA) can simultaneously transmit to different sets of users on multiple subcarriers. In OFDMA-SDMA systems, 
multi-user diversity allows an increase in the system throughput by assigning transmitting resources to users with good channel 
conditions. High data rates are thus made possible by exploiting the degrees of freedom of the system in time, frequency and 
space dimensions. OFDMA-SDMA is also supported by WiMAX and LTE-Advanced systems which are the technologies that 
will most likely be used to implement fourth generation (4G) cellular networks HIj El- 



Due to the increased degrees of freedom it is critical to use a dynamic and efficient resource allocation (RA) mechanism 
that takes full advantage of all OFDMA-SDMA transmitting resources |3|. The role of an RA and scheduling algorithm is 
to allocate the resources for transmission required to meet the quality of service (QoS) requested from upper layers. In this 
paper, we focus on resource allocation policy for an OFDMA-SDMA system supporting real time traffic with minimum rate 
requirements. 

A. State of the Art 

The combinatorial nature of the RA problem makes it NP-complete JU. For an OFDMA-SDMA system with a practical 
number of subcarriers, users and transmit antennas, it is thus almost impossible to solve the RA problem directly. Therefore, 
most research work focuses on developing heuristic and near-optimal RA algorithms. 

Traffic in the system can be divided into two main groups: delay sensitive real time (RT) services and delay-insensitive 
non real time (nRT) services. Early work on OFDMA-SDMA systems focused on solving the RA problem for only nRT 
services, where the objective was to maximize the total throughput with only power constraints and possibly minimum BER 
constraints. In Q the complete optimization problem was divided into a per-subcarrier user selection problem and a power 
allocation problem, which are both solved heuristically. A similar approach using Zero Forcing (ZF) beamforming was reported 
in ||5|. The work of ID, ||5| does not solve the complete optimization problem because of its computational complexity. Instead, 
it is separated into uncoupled subproblems that provide non-optimal solutions. For this reason, optimal or good approximations 
to near-optimal solutions are important to benchmark the heuristic algorithm performance. Several methods to obtain near 
optimum solutions have been proposed for the RA problem with nRT traffic only. For example, in ||6l genetic algorithms are 
proposed, while Q, [S], ||9l, ifTOl provide methods to compute a near-optimal solution based on dual decomposition methods. 
In addition to providing a benchmark, near optimal algorithms can also lead to the design of efficient RA methods as shown 
in ifTOl where heuristic algorithms derived from the dual decomposition methods are proposed. 

Several approaches have been proposed to solve the OFDMA-SDMA RA algorithm with RT traffic services. In some works, 
the design of the RA algorithm supporting real time QoS is intertwined with the scheduler design. The scheduler decides at 
each time slot which users must be served and sets their priorities in the RA utility function ifTTl . Priority can thus be given 
according to the current deadlines for RT services by increasing their weight in the utility function while achieving some degree 
of fairness for nRT services |[T2l . The utility function is used as the objective to maximize by the RA problem without any 
explicit minimum rate constraints for the RT users. Similarly, RA heuristic algorithms for RT services have also been proposed 
where the users are served in priority according to the packet waiting time and urgency to be served ifTJl . lfT4l . ifTSl . Note 
that with these reactive approaches, the minimum rate requirements are not modelled as hard constraints in the per-slot RA 
problem. Instead, RT users with poor channel conditions are backlogged until their delay is close to the deadline and then the 
RA reacts and allocates resources to them. This causes an increase in the average delay and delay jitter, which is not suitable 
for RT services. 

Another approach is to use constraints on the average rate deUvered to a user |fT6l . However, unlike the work presented 



in ifTTl where a near optimal solution is provided for the single antenna OFDMA RA problem with average rate constraint, the 
algorithm presented in |fT6l is a heuristic approximation. Note that with average rate constraints, RT users tend to be served 
when they have good channel conditions which can create unwanted delay violations and jitter. 

In ifTsl . an heuristic algorithm for the RA problem with proportional rate constraints is proposed. Although rate constraints 
are used on a per slot basis, the actual rate is relative to the total rate and does not offer a guaranteed minimum rate to RT 
users. 

Heuristic algorithms have also been proposed for the RA problem with minimum rate constraints 
|[T9l , II20I . For example, in |[T9| a subcarrier exchange heuristic is proposed to satisfy the minimum rates. 

B. Paper Contribution and Organization 

To guarantee the required QoS for RT users it is preferable to integrate their minimum rate requirements into the RA 
optimization problem rather than using reactive methods without explicit rate constraints. Some heuristics have been proposed 
for the OFDMA SDMA RA problem with minimum rate requirements but no optimal or near-optimal solution has been 
derived, as was done for the OFDMA-SDMA RA problem for nRT traffic. Having such a solution is important to benchmark 
heuristics and can also lead to the design of more efficient heuristics. The main contribution of this paper is therefore an 
efficient method that provides a near-optimal solution to the following OFDMA-SDMA RA problem for mixed RT and nRT 
traffic: for a given time slot, find the user selection and beamforming vectors that maximize a linear utility function of the 
users rates, given a power constraint and minimum rate constraints for RT users. The user weights in the linear utility function 
are arbitrary and can be the result of a prioritization or fairness policy by the scheduler We focus on the solution of the RA 
optimization problem by using a Lagrange dual decomposition method. The solution to the dual function provides an upper 
bound to the primal RA problem. We show, for small cases where it is possible to find the optimal solution to the primal 
problem, that the duality gap is small. We also propose a simple off-line near optimal algorithm which, based on the solution 
obtained from the dual decomposition method, provides a feasible solution to the RA problem. We study several cases where 
we compare the performance of the dual upper bound, the feasible solution provided by the proposed method and a solution 
obtained using weight adjustments in the utility function. The results indicate that the proposed method is close to the upper 
bound while methods adjusting the user weights in the utility function in order to prioritize RT users lead to significantly 
sub-optimal solutions for the OFDMA-SDMA RA problem. 

The paper is organized as follows. We describe the system and formulate the optimization problem we seek to solve in 
Section We present in Section |III] the dual-based method. In Section |IV] we present an algorithm that finds a feasible 
solution based on the dual optimal solution and we present two other alternative methods: one that solves the problem by 
finding the exact solution, the other by using the weight adjustment technique. In Section |V] we compare the performance and 
computation time of our method against the two alterative methods and in Section |VT] we present our conclusions. 



II. System description and Problem Formulation 

We consider the resource allocation problem for the downlink transmission in a multi-carrier multi-user multiple input single 
output (MISO) system with a single base station (BS) serving K users, where some of those users have RT traffic with 
minimum rate requirements while the others have nRT traffic. The base station is equipped with M transmit antennas and 
each user has one receive antenna. In this configuration, the BS can transmit in the downlink data to different users on each 
subcarrier by performing linear beamforming preceding at the BS. At each OFDM symbol, the base station can change the 
beamforming vector for each user on each subcarrier to maximize some performance function. In this paper, we assume that 
capacity achieving channel coding is employed and the data rate units are in terms of bits per OFDM symbol or equivalently 
bits per second per Hertz (bps/Hz). 

A. Signal Model 

First we describe the model used to compute the bit rate received by each user. Define 

Sk,n the symbol transmitted by user k on subcarrier n. We assume that the Sk.n are independently identically distributed 

random variables with Skji ^ CA/'(0, 1). 
w/j „ a M -component column vector representing the beamforming for user k on subcarrier n. Unless otherwise noted, we 

denote w the vector made up by the column stacking of the vectors Wfe,„. 
x„ a M -component column vector representing the signal sent by the array of M antennas at the BS for each subcarrier 

n. 

hfe „ a M -component row vector representing the channel between the AI antennas at the BS and the receive antenna at 
user k for each subcarrier n. 

Zk,n the additive white gaussian noise at the receiver for user k on subcarrier n. The Zk,n are independently identically 

distributed (i.i.d.) and without loss of generality we assume that „ ^ CAf{0, 1). 
yk,n the signal received by user k on subcarrier n. 
r° ,j the rate of user k on subcarrier n. 
The signal vector x„ is built by a linear preceding scheme which is a linear transformation of the information symbols „: 

k 

The signal received by user k on subcarrier n is then given by 

= hfc,„Wfe^„Sfe^„ + ^ hkjiV^j.nSj.n + Zk,n- (2) 

The second and third terms in the right side of (|2]i correspond to the interference and noise terms, respectively. Since the 
signals and noise are Gaussian, the interference plus noise term is also Gaussian and the data rate of user k for subcarrier n 



is given by the Shannon channel capacity for an additive white Gaussian noise channel: 

B. Rate Maximization Problem 

The general rate maximization problem corresponding to the OFDMA-SDMA RA problem with mixed RT and nRT traffic 
is to find a set of beamforming vectors Wfe.„ that will maximize the users weighted sum rate. This is limited by the total power 
available for the transmission at the base station and some users with real time QoS requirements must receive a minimum 
rate. More precisely, we assume that we know 

K Number of users in the cell. 

K. Set of users in the cell: {1, . . . , K}. 

T) A subset of JC containing the users that have minimum rate requirements. We define D = \'D\. 

M Number of antennas at the BS. 

dk Minimum rate requirement for user k. 

N Number of subcarriers available. 

P Total power available at the base station for transmitting over all channels. 
Cfc Weight factors that are used by the scheduler to implement prioritization or fairness. 
We then want to solve the following optimization problem to obtain the resource allocation 

w ^ — ^ ' 

n=l,k=l 

N,K 

||wfc,„||2<P (5) 

n=l,k=l 
N 

$^<„(w)>4, kev. (6) 

n=l 

The power used by the transmitter is represented by the sum of the squared norms of the beamforming vectors in constraint (|5]l. 
The achievable rate over all subcarriers should be higher or equal than the required minimum rate per user as in 

Problem (|4l|6]l is a non-convex, nonlinear optimization problem. Using an exact algorithm to find a global optimal solution 
is very hard considering the size of a typical problem where there can be up to a hundred users and hundreds of sub-channels. 
Another option is to use a standard non-linear program (NLP) solver to compute a local optimal solution and use different 
starting points in the hope of finding a good global solution. The problem with this approach is that 1) we don't know how 
close we are to the true optimum and 2) the technique is quite time-consuming. Albeit those problems, we explored this 
approach and observe that most users end up with a zero beamforming vector and only a small subset of users (< M) actually 
get some rate. Furthermore, in accordance to what was reported for the SDMA problem in 1211 . we observed that at high 
SNR the ZF solution is very close to the local optimum. This ZF solution is easily computed by channel diagonaUzation and 



water-filling power allocation. For these reasons, we now turn to the so-called Zero-Forcing beamforming strategy. 
C. Zero-Forcing Beamforming 

In general, user k is subject to the interference from other users which reduces its bit rate, as indicated by the denominator 
in ([3]). Zero-forcing beamforming is a strategy that completely eliminates interference from other users. For each subcarrier n, 
we choose a set of 5 < M users which are allowed to transmit. This is called a SDMA set. We then impose the condition 
that for each user k in this set, the beamforming vector of user k must be orthogonal to the channel vectors of all the other 
users of the set. This amounts to adding the orthogonaUty constraints 

hfc,„Wj,„ = j ^ k, j,k e (f> (7) 

and the user k data rate for subcarrier n simplifies to: 

rln (wfe,„) = log2 (1 + ||h,,„w,,„||2) (8) 

With zero-forcing, the beamforming problem is now made up of two parts. We need to select a SDMA set for each subcarrier 
and for each selected SDMA set, we must compute the beamforming vectors in such a way that the total rate received by all 
users is maximized. Because of this, we need to add another set of decision variables 

ak,n a binary variable that is 1 if we allow user k to transmit on subcarrier n and zero otherwise. We denote the collection 
of ak,n by the vector a . 
This results in the ZF problem 

N,K 



maxJ7^= V Cfcr^„(wfc,„) (9) 

w,q; ^ — ^ 

n=l,k=l 

N,K 

\\^k.n\\^<P (10) 

n=l,k=l 
N 

I]'^fe,„(wfe,„) > 4, kev (11) 

n=l 

k 

{h.k.n'W-j^nf < B[{1 - ak,n) + - Vu, Vk, k j (13) 

||wfc,„|| < Aak,n (14) 
afc,ne{0,l} (15) 

where A and B are some large positive constants. Constraint ( fTSl i guarantees that we do not choose more than M users for 
each subcarrier and constraint ( fT3] ) guarantees that if we have chosen two users k and j, they meet the zero-forcing constraints 
and that the constraints are redundant for other choices of users. Constraint ( fT4l ) guarantees that the beamforming vector is 
zero for users that are not chosen. It would seem that the zero-forcing model is not improving things much: We have gone 



from a non-convex nonlinear program to a non-convex mixed nonlinear program. However, as we will explain in the next 
section, this allows us to design an efficient and accurate algorithm. 

III. Dual-Based Solution Method 

We cannot solve problem (|9l-[T5]) fast enough to use it for a real time algorithm. Nevertheless, we need to compute solutions 
so that we can use them as benchmarks to evaluate the quality of real time heuristic approximations. We now present an 
off-Une solution technique that is tractable for problems of moderate size and that can give either near-optimal solutions or at 
least a bound when the optimum cannot be reached. 

Solving the zero-forcing problem will require some form of search over the a variables. Note that this ranges over all subsets 
of users smaller than AI x N so that the search space is going to be fairly large. Our first transformation is thus to separate 
the problem into single-subcarrier subproblems. For this, we dualize the constraints ( fTOb and ( fTTI ) since they are the ones that 
couple the subcarriers. Define the dual variables 

A Lagrange multiplier for power constraint ( fTOb . 

fik Lagrange multipliers for minimum rate constraint ( fTTT i of user k. The collection of fik is denoted fi. 
In order to simplify the derivation, we define the dual variables i^ik for all users k E IC. For users with no minimum rate 
requirements (k ^ V), we have fik = 0. In what follows, we use the standard form of Lagrangian duality which is expressed 
in terms of minimization with inequality constraints of the form <. Under these conditions, the multipliers X, fi > 0. We get 
the partial Lagrangian 

"'at 



N,K 

^ = - Cfcr2^„(wfc,„) + A 

n=l,fe=l 



N.K 

)i=i,fe=i 



E 



fJ-k 



= -AP+EMfc4 + E^ - E(^fe+'"''^)^","(^*^-.") + ^Eii^fe'"ii' f ■ (16) 

k n [ k k ) 

The value of the dual function 9 at some point (A, fi) is obtained by minimizing the Lagrange function over the primal 
variables 

9(A, /x) = min£(A, /X, w, ck) (17) 



and the dual problem is 

maxQ{X,n) (18) 

A,/i. 

A,/x>0 (19) 

which we can solve by the well known subgradient algorithm Il22l . From now on, we concentrate on the calculation of the 
subproblem ( fTTI ). 



A. Subchannel Subproblem 

Because of the relaxation of the carriers coupUng constraints, the subproblems in (fT7] i decouple by subcarrier since the 
objective ( fTSI ) is separable in n and so are constraints (fl"2l - [l4l i. Computing the dual function then requires the solution of N 
independent subproblems. For subcarrier n, after dropping the n subscript, this has the form 

miii/„ - - V(cfc + /^fe)r^(wfc) + A V llwfef (20) 

k k 

Y.ak<M, (21) 

k 

(hfew,)' <S[(1 -«,) + (! -a,)], yk,yj,k^j (22) 
\\^k\\<Aak (23) 
ak e {0,1} 

Problem ( 1201423] ) is still a mixed NLP, albeit of a smaller size. 

B. SOMA Subproblem 

A simple solution procedure is to enumerate all possible choices for ak.n that meet constraint ( fT2] i. This is called the 
extensive formulation of the problem. Each such choice defines a SDMA set which we will denote by s and k = \s\. For each 
s, we solve the optimal beamforming problem 

max/„., = V 4 log2 (1 + (hfeWfc)') - A||wfe||2 (24) 

hjWfc =0 j j,kes (25) 

where c^. ~ + Hk- Note that constraint (ISTT i is automatically satisfied by the construction of s, constraint i23[ simply drops 
out since Wfc = for k ^ s and constraint (|22] | remains only for k E s, but we write it as (|25] | because we are considering 
only users that belong to SDMA set s. 

This is certainly not a feasible real time algorithm, but for realistic values of K, say around 100, and M = 4, the number 
of cases is still manageable. This can give near-optimal solutions against which to compare heuristics. This is possible only if 
the SDMA beamforming sub-problem ( 1241425] ) can be solved efficiently. 

C. Beamforming Subproblem 

This is in fact the case because it separates into k independent problems, one for each user in the SDMA set. Here again 
we drop the k index to simplify the discussion. We have to compute the beamforming vector w for a given user For this user, 
we know the set of channel vectors for the other members of s. We denote these vectors by the (k — 1) x M matrix H. We 



also denote the channel vector for the user under consideration by h. The problem is then 

niax/„,,,fe = 4 log2 (l + (hw)') - (26) 
Hw = (27) 

so that for realistic values, this is a small nonlinear program. There are M variables and k — 1 linear constraints. It can 
be solved quickly by a number of techniques. Still, the overall computation load can be quite large. There will be k such 
problems to solve for each SDMA set, and there are S = J2i=i (T) ^^^^ ^^^^ ^^^^ '■^^ ^ subcarriers so that we have 
to solve the problem k x S x N times and this for each iteration of the subgradient algorithm. Clearly, any simplification of 
the beamforming subproblem can reduce the overall computation time significantly. 

D. Approximate Solution of the Beamforming Problem 

This can be done by the following construction. Instead of searching in the whole orthogonal subspace of H as defined 
by ( |27] ). we pick a direction vector in that subspace and search only on its support. This will give a good approximation to the 
extent that the direction vector is close to the optimal vector. The choice of direction is motivated by the fact that the objective 
function depends only on the product hw. We then introduce a new independent variable 

g = hw (28) 

and because this variable is not independent of w, we add Eq. ( |28] ) as a constraint. We then get the equivalent problem 

max/ = c'log. (1 + ^2) _ A||w||2 (29) 
(hw) = q (30) 
Hw = (31) 

which we can rewrite in the standard form Gw = b where the G matrix is the concatenation of h and H and = 
[g,0,0...0]^. 

Since we are proposing to transform the constrained optimization over the k variables into an unconstrained optimization 
over q only, we must be able to express w as a function of q. Since the linear system is under-determined, this is obviously not 
unique. We use G+, the pseudo-inverse of G, for this back transformation. We then have w = G+b. A well known property 
of the pseudo-inverse is that it picks the vector of minimum norm compatible with the linear system. In other words, choosing 
this transformation will minimize |w| so that it is minimizing the power term in the objective function. Because A > 0, this 
has the effect of contributing to the maximization of /. 

Expanding the matrix equation for w, we find that 

Wi = G.^-^q i ^ 1 . . . K (32) 



which can then be replaced in the objective function. We get the unconstrained problem 



maxc' log2 (1 + q^) - \(fr (33) 
1 

where 7 = ||G]^|| and G\ denotes the first column of G+. Changing the variables p = and adding the constraint p > 0, 
we get the equivalent problem 

max c' log(l + p) - A7^p (34) 
p 

p > (35) 



which has the solution 

r c 

so that the computation time is basically the evaluation of G+. 

E. Computing the Dual Function 

To summarize, after reinstating all indices, we can write 



p = max < 0, "If" (36) 



e(A,/x) 



\ fc n J 

/„=max{/„^} (38) 

fn^s ^ ^ fn^s^k (39) 

fees 

fn,s,k = 4log(l 4-p„,s,fe) - A||w„^s,fer (40) 
Pn,s,k = max \ 0, 1 i (41) 

7n,.,fe = l|Gf ||„.,,,fe (42) 
'Wn,s,k = G+ j,b„,s,fc (43) 

hn,s,k = [Pn,s,k,0,---Of (44) 

and G^g ^, is the pseudo-inverse of the concatenation of h„_fc and H„ ^.fe. We denote s*(n), n = 1, . . . , A^, the solution of the 
maximization operation over s in (l38T l. This is the optimal SDMA set for subchannel n for the current values of the multipUers. 
We also denote w* the optimal beamforming vectors for the users k E s*{n). The largest part of the computation to evaluate 
the dual function is the calculation of G^ ^ ^, which has to be done for each subchannel, each SDMA set and each user in 
these SDMA sets. The number of evaluations can become quite large but the size of each matrix is relatively small so that 
the calculation remains feasible for medium-size networks. Another advantage is that while solving the dual problem requires 
multiple subgradient iterations, the calculation of the pseudo-inverses is independent of the value of the multipliers. This means 
that can be done only once in the initialization step of the subgradient procedure. 



For convenience we define $(A,/i,) as the negative of the dual function, 

$ = -e = AP-^Aifc4+^ max (45) 

k n 

and minimize this function when solving the dual problem ( fTsT l. Algorithm [T] finds the optimal dual variables (A*,/x*) that 
solve the dual problem (fTSl l using the subgradient method \22\ with a fixed step size 5 . The optimum value $* is a bound for 
the primal objective and thus for any feasible point of the primal problem (|9]l. If C/^ is the objective achieved by any feasible 
point in the primal problem (|9]l and U* its optimum, the following inequalities hold 1231 

$(A, /x) >$*>[/*> t/^ (46) 

The dual optimum found $*, or any approximation to it, is thus an upper bound to the optimum value of the primal problem 
which can be used to benchmark other solution methods. 



Algorithm 1 Calculation of the dual solution 

Construct the set S of all subsets of users of size 1 < k < M 
for all n = 1 . . .N do 
for all s G 5 do 
for all fc e s do 

Compute the pseudo-inverse G^^ ^ and ^n,s,k 
end for 
end for 
end for 

Choose an initial value A° and 

Subgradient iterations. We set a limit of on the number of iterations 
for all i = 1 . . . I„i do 

Solve the N subproblems dSSl ) 

Compute the subgradients: 

ff^'"' = dk - E« 'rn,k 

5A = E„E.e..(„)l|w:,fe||2-P 
if llg^ll < e and H^aII < e then 

Break 
else 

Update the multipliers 

A'+i = [A' + 5g>] + 

end if 
end for 



F. Performance of the Dual Method 

We study in this section the convergence speed and computation time of the dual algorithm for single random channel 
realizations (the average performance is studied in Section [V]i. We present in Figure [T] the value of the dual function and 
Lagrange multipliers as a function of the number of iterations while the total transmit power and the rate received by user 1 
(single user with minimum rate requirements) are shown in Figure |2] We can see that the method converges very quickly to a 
solution that is both close to the minimum value and feasible. We observed a similar behavior for several other configurations. 



We can see the computation time (i.e., the time used by the CPU) required to solve a problem as a function of the number 
of users in Figure |3] and as a function of the number of channels in Figure |4] In Figure |4] we further separated the CPU time 
between the CPU time required to compute the pseudo-inverse at the initialization and the CPU time required to find the dual 
solution. There is a striking difference since the CPU time growth is much faster than linear as a function of K while it is 
very linear as a function of N . This is obviously because the Lagrangian decomposition separates the overall problem into N 
independent subproblems and for fixed K, the CPU time will grow as the number of subproblems. As a side note, the last 
value plotted in Figure [3] is much lower because for this random channel realization the problem had no active rate constraints, 
thus no extra iterations were required. Another important point however is that calculating the pseudo-inverses is much more 
time-consuming that solving the dual itself. Notice that the two curves in Figure |4] almost overlap but they are plotted on a 
different scale. 



The SDMA set selection and beamforming vectors found by algorithm [T] do not always provide a primal feasible solution. 
The rate or power constraints might be violated whenever the algorithm stops because the number of iterations has been 
reached before the convergence rule is met. In this section we present three different approaches to solve the primal problem. 
In Section IIV-AI a direct method where we enumerate all the variables a. and solve the problem for the beamforming vectors 
w in each case is explained. In Section IIV-BI we propose a simple procedure to obtain a feasible solution from the dual 
solution found with algorithm [T] Finally, in Section IIV-CI we propose a method based on weight adjustments of the utility 
function to meet the minimum rate requirements. In Section IIV-DI we study the difference between the dual-based and the 
weight adjustment methods. 

A. Exact Solution 

One way to evaluate the accuracy of the dual algorithm is to compare it with an exact solution. Problem (I9m4li is a nonlinear 
MIP for which we can do a complete enumeration of the binary variables a., the set of w variables is determined by the choice 
of a particular a. and constraints (fl4l i are automatically satisfied. The same remark goes for Eq. ( fTSl l where the only remaining 
constraints are the ones for which Qfc „ = 1- The ZF constraints are now written as ( fSOl l where s{n) is the SDMA set for 
subchannel n, given by the value of a. For each given value of ct, we then need to solve the optimal beamforming problem 



IV. Other Solution Algorithms 



NJ< 



max 



w 




(47) 



Tl=l,fc=l 



NJ< 




(48) 



n=l,k=l 



N 




(49) 



)i=i 



{hk,n'Wj,n)'' = j, k e s(n), j \fn 



(50) 



where the optimization variables are the beamforming vectors of users in 5(77,) for each subchannel n. The main difference 
with the dual method is that here, we have to enumerate the set of all possible choices of subsets of K users, M antennas and 
N channels. This is a much larger set than for the dual method where the enumeration is done separately for each channel 
and for this reason, we cannot expect to solve very large problems with the exact solution approach. 

Each beamforming sub-problem (l47T i is relatively small but it is not convex. We use the same technique as in Section IIII-DI 
to simplify it. First, we group the user vectors belonging to SDMA set s{n) in a \s\ x M matrix H„ and we assume a given 
1 X |s| user power vector p„. The ZF constraints ( fSOl l are written in matrix form as 

H„W„=diag(Vp;:), Vn (51) 

and we restrict the beamforming vectors to be in the direction given by the pseudo-inverse matrix 

W„ = H„tdiag(Vp;:), Vn. (52) 

The problem then reduces to the optimization over the vector p„ 

N 

m&xU^ = Cfe^log2(l+Pfc,„) (53) 

k£s{n) n=l 

N 

Y f^k,nPk.^<P (54) 
n = l kes(n) 
N 

Y log2(l +Pfc,„) < 4, keV (55) 

n=l 

L J k,k 

Pn > 0. (56) 

Problem (|53H56] l is convex since we are maximizing a concave function over a convex set and can be solved by standard 
techniques. The overall procedure to find an exact solution by enumeration is summarized in algorithm (|2]i. 

Algorithm 2 Enumeration Algorithm 

MAX ^ 

for i = l to S'^ do 

Solve problem ( |53]456] | for given SDMA set assignment Si{n) with objective function 
a > MAX then 
MAX 

end if 
end for 



B. Dual-Based Feasible Solution 

Algorithm [T] can provide an optimal solution (A*,//.*) to the dual problem ( fTSl ). However, the subgradient algorithm is 
known to converge slowly and, in some cases, we need to stop the iterations before the algorithm has reached an optimal dual 



solution. In these cases, the optimal solution may not be feasible. In this section we are proposing a refinement of the dual 
method to construct a feasible solution starting from the final solution found for the dual problem. 

Algorithm [3] summarizes this method. The algorithm begins by solving the dual problem ( flSl l using algorithm [T] If the 
solution is not feasible either directly or by recomputing the power allocation using (I53l456l l for the SDMA set assignment 
found in the dual problem, the algorithm performs a search by increasing the dual variables associated to the users whose QoS 
constraints are not met until a new SDMA set assignment is found. It then solves the power allocation problem ( |53] - l56] l for 
this new SDMA set assignment and checks the solution feasibility with regards to the minimum rate constraints. The search 
for new SDMA sets continues using this method until a feasible SDMA set assignment is found or a maximum number of 
iteration is reached. 



Algorithm 3 Calculating a feasible point from the dual solution 

Solve the dual problem (fTsT l using algorithm [T] This yields the optimal dual variables A* , fij. and a SDMA set assignment 
vector s*{n) for each subchannel n. 
Set sg(n) = s*(n) 

Evaluate total power and user rate constraints (fToHTTT l 
if All constraints are met then 

Exit. A feasible solution has been found, 
end if 

Compute power allocation problem ( |53] - |56] | for sg(n) and evaluate total power and user rate constraints ( fToUTTT i 
if All constraints are met then 

Exit. A feasible solution has been found, 
end if 

Compute the multipliers fik for users k such that < dk 
for j = 1 to J do 

/ifc = Hk + S 

Find s° = argmaxs{/„ where /„_s is given by i39[ for the current dual variables X, fi 
Let s°{n) be the SDMA assignment found 
if s°{n) ^ s°_i(n) then 

We have found a new SDMA assignment 

Compute power allocation problem ( |53] - I56b for s° (n) and evaluate total power and user rate constraints ( fToUTTI ) 
if All constraints are met then 

Exit. A feasible solution has been found, 
end if 
end if 
end for 

Exit. A feasible solution was not found. 



In contrast to the enumeration method described in Section IIV-AI which performs an enumeration of all possible SDMA set 
assignments, the dual-based algorithm |3] is a method that finds a SDMA set assignment close to the dual optimal and then 
uses it to solve one sub-problem ( |53] - |56] |. This makes the search for a near-optimal feasible solution much faster than finding 
the exact solution. 

C. Weight Adjustment Method 

In Section II-AI we discussed several RA algorithms that support RT traffic which increase the user weights in the utility 
function until such users receive transmission resources. In this section we thus propose a weight adjustment method to evaluate 



the efficiency of algorithms that use this approach. The objective of the proposed method is to find a set of weights in the 
utiHty function (|9]l for which, when we solve problem (I9l-[l4li without the rate constraints ( fTTT i. the rate requirements of the 
RT users are met. We also want the set of weights to have the least deviation between users in order to maximize the multi- 
user diversity gain. Algorithm |4] implements a generic method for weight adjustment to achieve this objective. The algorithm 
increases the user weights for RT users until enough resources are allocated to meet the minimum rate requirements. The 
parameter e controls how much the weights are increased with respect to the rate bounds. 

Algorithm 4 Weight adjustment algorithm 

Solve RA problem (|9]l without minimum rate constraints constraints ( fTTT i 
c' ^ c 

Let rfe be the achieved rate for user k at every iteration 
iteration ^ 1 

while (rfc < dk for one or more users k E V) AND (iteration < MAX_iterations) do 

Increase user weight using cj. = c'^ + e {dk — r^) for users in need, where < e < 1 

Solve RA problem (|9]l without minimum rate constraints (fTTT i using user weights c' 

iteration iteration +1 
end while 



D. Comparison of the Weight Adjustment and the Dual-Based Methods 

The rates achieved by weight adjustment algorithm |4] and the dual-based algorithm [3] are different since they solve different 
problems. That is, algorithm |4] can be seen as solving problem ( |53]456] l by a linear penalty method for constraints (|55] l of the 
form 

Pk = min{0,rfe - dk] 

The modified objective function is then 

Up^Y^ CkTk + Pk 
k 

= Yckrk + t [vk-d) (57) 

k\rk<d 

At each iteration of the penalty method, whenever rate constraints are active, the solution of dSTl ) cannot be smaller than that 
of ( |53l - l56] l since it is a relaxation. Notice that problem ( |57] i is quite simple since it has a single constraint ( |54l i but it has to be 
solved many times to adjust the weights of the real time users. In weight adjustment algorithms such as |fT3l . 11741 . the user 
weights are increased at each time slot using an increasing function of the packets delay, so the computation task is distributed 
over time. However, this distributed approach does not guarantee that the rate requirements are met in a given time slot and 
leads to delay violations and jitter. 

To illustrate numerically the difference between solving the problem with explicit rate constraints versus modifying the user 
weights, we simulated the performance of both approaches for a single random channel realizations. We set a minimum rate 
constraint of di = 8 bps/Hz for one RT user We first solved the problem using the dual-based algorithm [3] using equal weights 
for all users. The sum rate is shown with the dash line in the top plot of Figure |5] while the user 1 rate is shown with the dash 



line in the bottom plot of Figure |5] To specifically study the impact of the utility function weights on the performance, we then 
solved problem (I9m4b without the rate constraints ( fTTT ) for different weight assignments as follows: All user weights were set 
to 1 initially and we varied the user 1 weight from 1 to 10. The corresponding user 1 and sum rates are indicated in Figure |5] 
with the star line. We can observe three regions as a function of the user 1 weight ci. For low values of ci, user 1 does not get 
the required minimum rate so that the total rate is much larger that the optimal value, which is to be expected for an unfeasible 
solution. For middle values, the user 1 rate is much better but is not feasible and the total rate is correspondingly smaller but 
still larger than the optimum. Finally, if we increase ci such that the required rate must be satisfied to a high accuracy, the 
total rate is lower than the value obtained with the dual-based method. Note that the total rate and user 1 rate discontinuities 
observed for the weight adjustment method are due to the fact that the selected SDMA set changes as we increase the weight 
and the solution moves from one region to another 

Another example is shown in Figures |6] and Q where we plot the total objective and the user rate, respectively, as a function 
of ci. We consider two cases for the user 1 rate requirement, one where di = 48 bps/Hz and the other where di = 66 bps/Hz. 

The results show that for a value of ci w 1.5 the weight adjustment approach provides the same total rate as the dual-based 
method and the user 1 rate requirement is met. However, it is interesting to note that the range of ci over which this is possible 
is quite narrow: A slightly lower value produces an unfeasible solution and a slightly higher value, a much larger rate allocation 
for the user 1 with a lower total rate. In other words, one would need many iterations of the weight adjustment algorithm to 
get the required accuracy to find the optimum. For the other case, where the rate constraint is higher, we see that the weight 
adjustment method is unable to find a solution that is both feasible and reasonably close to the optimal value. This is a clear 
indication that the rate adjustment algorithm should be used with care, especially in those cases where the rate constraints are 
important. 

These examples show that even if we were to perform an exhaustive search to determine the weights c', the performance 
would not be as good as the dual-based method. For the cases where the performance is comparable, we also observed that 
the weight adjustment method performance is very sensitive to the weight value, which is not a desirable property. However, 
if an efficient method to adjust the weights is used, the computational load of the weight adjustment method would be lower 
than that of the proposed dual-based method because it solves a problem with a single constraint. 

V. Numerical Results 

A. Parameter Setup and Methodology 

In this section, we compare the performance and computation time of the different methods to solve problem (|9]l. We used a 
Rayleigh fading model to generate the user channels and assumed independent fading between users, antennas and subcarriers. 
Unless noted otherwise, we used a configuration with M — 3 transmit antennas, K = 16 users, = 16 subcarriers, and one 
RT user (i.e., D = 1). We also fixed the power constraint to P = 20 and used a large scale attenuation of dB for all users. 
The user weights in ^ were set to = 1 for all users. 

We compared the performance of the different methods for various scenarios where we increased the resource requirements 



for the RT users until the solution was unfeasible (i.e., the minimum rate requirements can no longer be respected for all RT 
users). In the first scenario, we increased the minimum rate dk for a single user with RT QoS requirements. In the second 
scenario we fixed the minimum rate but increased the large scale attenuation of the single RT user to study the case where RT 
users travel away from the BS while still having minimum rate requirements. In the last scenario, we increased the number of 
RT users with the same minimum rate requirements. 

For each scenario and channel realization, the dual solution upper bound was computed using algorithm [T] described in 
Section IIII-EI For small system configurations we also found the exact solution using the primal enumeration algorithm 
|2] given in Section IIV-AI We could only find this exact solution for small configurations because the computation time is 
prohibitive for large system configurations. We also computed the solutions given by dual-based primal feasible algorithm [3] 
and the weight adjustment algorithm |4] described in Section HV-BI and llV-CI respectively. The provided results are the average 
total rate and computation time over 100 independent channel reaUzations. Furthermore, whenever a result is provided for a 
given configuration and algorithm, the solution was feasible with respect to the minimum rate requirements. We also use the 
bound given by the dual optimal solution as a reference given that the exact solution is generally not available except for very 
small size cases. 

B. Total Rate Performance 

First we present in Table U the average gap in percent between the three different methods to find feasible solutions against 
the dual optimal upper bound for a small system configuration with K = 4 users and N ^ 2 subcarriers. We increased the 
minimum rate requirement for one RT user from 13.33 to 20 bps/Hz. As the minimum rate increases, the upper bound decreases 
as more resources need to be assigned to the RT user until the problem is no longer feasible. For this small configuration, 
we see that all methods give excellent results and the duality gap is very small. Also note that due to the solver numerical 
accuracy and the fact that all solutions were close to each other, we even had cases where the dual-based feasible solution was 
better than the solution given by the primal enumeration method. 

In the remaining results we use the larger system configuration with K = 16 users and iV = 16 subcarriers, where it is 
no longer feasible to compute the solution using the primal enumeration method. We present in Table the difference in 
percentage between the solutions of the dual-based feasible algorithm and the weight adjustment algorithm and the dual upper 
bound. The dual-based feasible algorithm provides a solution within 0.25% of the dual upper bound. The weight adjustment 
method, on the other hand, is much worse and the difference can be almost 10%. This is due to the fact that the solution 
found by the weight adjustment algorithm does not change much when the minimum rate is increased, as can be seen from 
Figure [8] which shows the sum rate achieved by the dual-based feasible algorithm and the weight modification method against 
the minimum rate requirement. 

Figure |9] shows the average total rate when the large scale channel attenuation of the RT user varies from to 15 dB. As 
the user moves away from the BS, the RA algorithm dedicates more resources to this RT user until the problem is unfeasible. 
The results show that for all SNR, the dual based method provides a solution much closer to the upper bound than the weight 



adjustment method. Table Hill shows the error in percentage between the objective and the upper bound. For an attenuation of 
15 dB, neither method is able to find a feasible solution. 

Finally, Figure [TO] shows the optimal dual bound and the solution given by the dual-based feasible and weight adjustment 
methods as a function of the number of RT users. Table HV] lists the performance gap against the dual bound in percentage. 
The dual feasible method exhibits a much lower performance gap than the weight adjustment method for all values of the 
number of RT users. Moreover, we can see that the weight adjustment method performance quickly degrades when the number 
of RT users increases. It is not even able to find feasible points when the number of RT users is 6 or 7 while the dual-based 
feasible algorithm provides solutions within 3.52% of the upper bound. 

We can observe from all the results presented in this section (similar trends were also observed for other configurations that 
we studied) that the difference between the dual-based upper bound and the dual-based feasible solution is very small. This is 
an indication that the duality gap is very small and that it is possible to find feasible solutions close to the optimal, albeit with 
an off-line algorithm. Furthermore, the dual-based feasible solution is always better than the weight modification method and 
this difference becomes more significant as the resource requirements to meet the RT users needs increase. This shows that 
the weight modification method should be used carefully for RA in OFDMA-SDMA systems with RT users and that more 
efficient heuristics should be developed to approach the performance of the dual-based feasible solution. 

C. Computation Time 

We compare the computation time of the algorithms based on the time required by an Intel dual-core 660 CPU to solve the 
problem. The primal enumeration method is not included in the comparison due to its computation inefficiency. The average 
computation time to find a solution for the three scenarios (i.e. variation of minimum rate requirement, large scale channel 
attenuation and number of RT users) is presented in Tables to HV] 

In Table [III where the minimum rate constraints are varied, the computation time remains constant for the weight adjustment 
method, requiring an average of two main iterations of algorithm!!] The dual-based method takes on average 3.63 times more 
time than the weight adjustment method. However, the total rate that it produces is much closer to the upper bound. We have 
similar results in Table |III1 where we varied the SNR of the RT user The dual-based method produces a higher total sum rate 
than that of the weight adjustment method at the expense of a larger CPU time. 

Table |IV] shows the computation time as a function of the number of RT users. For the weight adjustment method the 
computation time is almost constant and slightly increases until the number of RT users is five, after which it cannot find 
feasible solutions. In contrast, the dual-based method computation time grows steadily with the number of RT users and it is 
approximately five times higher 

This comparison highlights the main difference between these two methods: While the weight adjustment method performs 
several iterations solving a problem with a single constraint, the dual method takes into account the rate constraints so that 
the computation time increases with the number of RT users. However, the performance gap against the dual upper bound of 
the dual method is very low and approximately constant. In contrast, the performance gap of the weight adjustment method is 



larger and increases with the number of RT users. Moreover, the dual method can find feasible solutions for cases where the 
weight adjustment method cannot. 

VI. Conclusion 

In this work, we proposed a method to compute the beamforming vectors and the user selection in an OFDMA-SDMA MISO 
system with minimum rates for some RT users. We used a Lagrangian relaxation of the power and rate constraints and solved 
the dual problem using a subgradient algorithm. The Lagrange decomposition yields sub-problems separated per subcarrier, 
SDMA sets and users which substantially reduces the computational complexity. We also used a closed-form expression of the 
beamforming subproblem based on a pseudo-inverse condition on the beamforming vectors. The dual function is expressed in 
terms of the dual variables and the dual optimum is found using a subgradient algorithm. The dual optimum can then be used 
as a benchmark to compare against other solution methods and heuristics. 

We then proposed, in addition to the complete enumeration approach which is not computationally practical for normal size 
problems, two algorithms to find feasible solution to the RA problem with minimum rate requirements. The first algorithm 
starts from the dual-based optimum solution and finds a feasible solution by searching along the rate requirement dual variables, 
while the second uses weight adjustments in the objective function to achieve the required rates. Our results show that the 
dual-based method provides solutions much closer to the upper bound than the weight adjustment method. The difference 
is more significant when the SNR of the RT users is low or the number of RT users is high. However, the computational 
load of the dual-based method increases when the number of users with minimum rate requirements increases, whereas it is 
almost constant for the weight adjustment method. This reflects the trade-off between the two methods. The weight adjustment 
method is computationally more efficient but the solutions provided by the proposed dual-based method are much closer to 
the optimal. This indicates that there is an advantage when including the minimum rate constraints in the resource allocation 
problem. In addition, the weight adjustment method requires many time slots to adjust the weights and schedule real time 
users. Our method explicitly includes the minimum rate constraints which allows RT users to be scheduled in the current slot, 
which decreases the average packet delay. 

To implement the RA algorithm in real time, we still need to design fast methods to reduce the number of SDMA sets to be 
searched. The design of these heuristic algorithms is outside the scope of this paper but it is part of our current efforts. Finally, 
the upper bound given by the dual function minimization provides a very useful benchmark to compare the performance of 
these heuristics and the dual-based algorithm can also guide the design of efficient novel heuristics. 
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170 



165 




w Weight adjustment sol 

- - - Dual Feasible sol. - rate constr. =48 

- - - Dual Feasible sol. - rate constr. =66 



10 



12 



RT user weigtit - 



Fig. 6. Total rate for weight adjustment and dual-based methods for M = 3, K = 8, N = 8, and P = 20. 




2 4 6 8 10 12 

RT user weight — > 



Fig. 7. User 1 rate for weight adjustment and dual-based metliods for M = 3, K = 



8, N = 8, and P = 20. 



355 r 




325 - 



320 - 



15 1 , , , , , , , , 

80 85 90 95 100 105 110 115 120 
Minimum Rate requirement (bps/Hz) — > 

Fig. 8. Average total rate as a function of the minimum rate requirements 




Fig. 9. Average total rate as a function of RT user large scale channel attenuation. 



List of Tables 

Average performance gap against the dual optimal upper bound for small system configuration. . . 

Average total rate gap and computation time as a function minimum rate requirement 

Average total rate gap and computation time as a function of RT user large scale channel attenuation 
Average total rate gap and computation time as a function of the number of RT users 



Method 


Minimum rate (bps/Hz) 


13.33 


16.66 


20 


Dual-based upper bound (bps/Hz) 


49.13 


47.12 


40.8 


Primal enum. gap (%) 


0.57 


0.55 


0.10 


Dual-based feas. gap (%) 


0.57 


0.59 


0.04 


Weight mod. gap (%) 


0.68 


0.71 


0.15 



TABLE I 
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Computation time (sec) 
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4.73 
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